Using accent-specific pronunciation modelling for robust speech recognition
نویسندگان
چکیده
A method of modelling accent-specific pronunciation variations is presented. Speech from an unseen accent group is phonetically transcribed such that pronunciation variations may be derived. These context-dependent variations are clustered in a decision tree which is used as a model of the pronunciation variation associatedwith this new accent group. The tree is then used to build a new pronunciation dictionary for use during the recognition process. Experiments are presentedfor the recognition of Lancashire& Yorkshire accented speech using a recognizer trained on London & South East England speakers. The results show that the addition of accent-specific pronunciations can reduce the error rate by almost 20% for cross accent recognition. It is also shown that worthwhile gains in performance can be obtained using only a small amount of accent-specific data.
منابع مشابه
Pronunciation variation modelling using accent features
In this paper, we propose a novel method for modelling native accented speech. As an alternative to the notion of dialect, we work with the lower level phonological components of accents, which we term accent features. This provides us with a better understanding of how pronunciation varies and it allows us to give a much more detailed picture of a person’s speech. The accent features are inclu...
متن کاملUsing accent-specific pronunciation modelling for improved large vocabulary continuous speech recognition
A method of modelling accent-specific pronunciation variations is presented. Speech from an unseen accent group is phonetically transcribed such that pronunciation variations may be derived. These context-dependent variations are clustered in decision trees which are used as a model of the pronunciation variation associated with this new accent group. The trees are then used to build a new pron...
متن کاملAcoustic and Lexical Modeling Techniques for Accented Speech Recognition
Speech interfaces are becoming pervasive among the common public with the prevalence of smart phones and cloud-based computing. This pushes Automatic Speech Recognition (ASR) systems to handle wide range of environments including different channels, noise conditions and speakers with varying accents. This thesis focuses on the impact of speakers’ accents on the ASR models and techniques to make...
متن کاملCombined acoustic and pronunciation modelling for non-native speech recognition
In this paper, we present several adaptation methods for nonnative speech recognition. We have tested pronunciation modelling, MLLR and MAP non-native pronunciation adaptation and HMM models retraining on the HIWIRE foreign accented English speech database. The “phonetic confusion” scheme we have developed consists in associating to each spoken phone several sequences of confused phones. In our...
متن کاملThe use of accent-specific pronunciation dictionaries in acoustic model training
Speech recognition systems are increasingly being built to cover an ever wider range of speaker accents. However, electronically available pronunciation dictionaries (PDs) specific to these accents often do not exist and would be time consuming and expensive to build by hand. This paper explores the use of pronunciation modelling for the synthesis of accent-specific PDs directly from acoustic d...
متن کامل